Safe Strategies for Agent Modelling in Games
نویسندگان
چکیده
Research in opponent modelling has shown success, but a fundamental question has been overlooked: what happens when a modeller is faced with an opponent that cannot be successfully modelled? Many opponent modellers could do arbitrarily poorly against such an opponent. In this paper, we aim to augment opponent modelling techniques with a method that enables models to be used safely. We introduce -safe strategies, which bound by the possible loss versus a safe value. We also introduce the Safe Policy Selection algorithm (SPS) as a method to vary in a controlled fashion. We prove in the limit that an agent using SPS is guaranteed to attain at least a safety value in the cases when the opponent modelling is ineffective. We also show empirical evidence that SPS does not adversely affect agents that are capable of modelling the opponent. Tests with a domain of complicated modellers show that SPS is effective at eliminating losses while retaining wins in a variety of modelling algorithms.
منابع مشابه
Artificial Intelligence Techniques in Games with Incomplete Information: Opponent Modelling in Texas Hold'em
Games have been widely used as an application for Artificial Intelligence techniques because of their simplicity and well-defined rules but in the other hand, for their huge range of possible and complex strategies to reach the final objective. In the last years, Artificial Intelligence study applied to games has focused more in games with incomplete information and non-deterministic games wher...
متن کاملMulitagent Reinforcement Learning in Stochastic Games with Continuous Action Spaces
We investigate the learning problem in stochastic games with continuous action spaces. We focus on repeated normal form games, and discuss issues in modelling mixed strategies and adapting learning algorithms in finite-action games to the continuous-action domain. We applied variable resolution techniques to two simple multi-agent reinforcement learning algorithms PHC and MinimaxQ. Preliminary ...
متن کاملSpatio-temporal agent based simulation of COVID-19 disease and investigating the effect of vaccination (case study: Urmia)
Proper management of epidemic diseases such as Covid-19 is very important because of its effects on the economy, culture and society of nations. By applying various control strategies such as closing schools, restricting night traffic and mass vaccination program, the spread of this disease has been somewhat controlled but not completely stopped. The main goal of this research is to provide a f...
متن کاملNash Equilibrium Strategy for Bi-matrix Games with L-R Fuzzy Payoffs
In this paper, bi-matrix games are investigated based on L-R fuzzy variables. Also, based on the fuzzy max order several models in non-symmetrical L-R fuzzy environment is constructed and the existence condition of Nash equilibrium strategies of the fuzzy bi-matrix games is proposed. At last, based on the Nash equilibrium of crisp parametric bi-matrix games, we obtain the Pareto and weak Pareto...
متن کاملMultiplayer Games and Competitive Business Models
The paper is based on agent plan computing where the interaction amongst heterogeneous computing resources is via objects, multiagent AI and agent intelligent languages. Modeling, objectives, and planning issues are examined at an agent planning. A basis to model discovery and prediction planning is stated. The new AI agent computing business bases defined during the last several years can be a...
متن کامل